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PROPORTIONAL FAIRNESS AND ITS RELATIONSHIP WITH 
MULTI-CLASS QUEUEING NETWORKS 

By N. S. Walton^ 

University of Cambridge 

We consider multi-class single-server queueing networks that have 
a product form stationary distribution. A new limit result proves a 
sequence of such networks converges weakly to a stochastic flow level 
model. The stochastic flow level model found is insensitive. A large 
deviation principle for the stationary distribution of these multi-class 
queueing networks is also found. Its rate function has a dual form that 
coincides with proportional fairness. We then give the first rigorous 
proof that the stationary throughput of a multi-class single-server 
queueing network converges to a proportionally fair allocation. 

This work combines classical queueing networks with more recent 
work on stochastic flow level models and proportional fairness. One 
could view these seemingly different models as the same system de- 
scribed at different levels of granularity: a microscopic, queueing level 
description; a macroscopic, flow level description and a teleological, 
optimization description. 

1. Introduction. In this paper we form descriptions of multi-class single- 
server queueing networks at different levels of granularity. Similar descrip- 
tions of electrical networks have been well studied and provide a good ana- 
logue of the results proven in this paper. 

One could form a Markov chain model of electrons in an electrical net- 
work. At this first level, one explicitly describes the location of particles. 
Transitions within the network occur rapidly, so perhaps it is more natural 
to consider results like Ohm's law and Kirchhoff 's law which are concerned 
with the current flowing through the network. At this second level, one con- 
siders the average flow of particles through the network. An electrical net- 
work also minimizes energy dissipation, as described by Thomson's principle 



Received September 2008; revised April 2009. 
^Supported in part by EPSRC Grant GR/586266/01. 

AMS 2000 subject classifications. Primary 60K25, 60K30; secondary 90K15, 68K20. 

Key words and phrases. Multi-class queueing network, proportional fairness, band- 
width sharing, stochastic flow level model, insensitivity, product form stationary distribu- 
tion, proportionally fair, state space collapse. 

This is an electronic reprint of the original article published by the 
Institute of Mathematical Statistics in The Annals of Applied Probability^ 
2009, Vol. 19, No. 6, 2301-2333. This reprint differs from the original in 
pagination and typographic detail. 



1 



2 



N. S. WALTON 



(primal) and Dirichlet's principle (dual). At this third and final level, one 
considers the network to be acting as an optimizer. For further discussion 
see Kelly [14], Section 2, and Doyle and Snell [7], Section 1. 

Just as we consider electrons, current and energy minimization in an 
electrical network, respectively, in a model of a packet switched network we 
consider packets, bandwidth and utility optimization. Following Shah and 
Wischik [24], we will use the terms microscopic, macroscopic and teleological 
to refer to these different descriptions of our network. Microscopic refers to 
a detailed description of a network's state though gives little insight into 
overall dependence. Macroscopic refers to an averaged view of the original 
network. Teleological considers a further abstracted view where the network 
can be seen to be acting globally as an optimizer. 

In this paper our microscopic model will be product form single-server 
multi-class queueing networks [11]; our macroscopic model will be a specific 
stochastic flow level model [2, 3, 8, 18, 19, 21] and our teleological model will 
be the proportionally fair optimization problem [15, 25]. The main results of 
this paper are concerned with forming rigorous connections between these 
different models. 

The first result of this paper is concerned with connecting our microscopic 
model to our macroscopic model. Stochastic flow level models for an intuitive 
model of document transfer across a packet switched network. Despite this 
no rigorous convergence proof has been constructed to justify a stochastic 
flow level model as the limit of a packet switching network. In Theorem 3.1 
we construct a proof to address this issue. We view a sequence of multi- 
class queueing networks as a simplistic model of document transfer across a 
packet switching network, and we prove weak convergence in the Skorohod 
topology of these networks to a specific stochastic flow level model. We call 
the resulting stochastic fiow level model the spinning network. The spinning 
network was first considered by Massoulie [21], Section 3.4, and formed the 
first insensitive stochastic flow level model. Insensitivity results on this model 
are given by Proutiere [21] and Bonald and Proutiere [3]. 

The second result of this paper is concerned with connecting both our mi- 
croscopic and macroscopic models to our teleological model. In particular, 
in Theorems 6.1 and 7.2, we give the first rigorous proofs of a mathemat- 
ical relationship between multi-class networks of single-server queues and 
proportional fairness. An argument justifying such a relationship has been 
made by Schweitzer [23], Kelly [13] and Massoulie and Roberts [19]. This 
noteworthy argument, presented in Section 5, considers the constraints that 
a queueing network may impose on transfer rates. Here we take a different 
approach and provide a rigorous proof using large deviations and convex du- 
ality. The use of large deviations is motivated by the relationship between 
balanced fairness and proportional fairness found by Massoulie [17]. In ad- 
dition, Pittel [20] has considered the large deviations of multi-class queueing 
networks but does not derive proportional fairness. 
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Fig. 1. Structure of the paper's results. 

To prove Theorem 6.1 we consider a large deviations principle for the sta- 
tionary distribution of a multi-class single-server queueing network and find 
a rate function /?(•). Applying the Contraction Principle, we gain a new rate 
function a(-), expressed as a convex optimization problem. In primal form, 
a(-) is interpreted as minimizing entropy subject to constraints. We find 
that the dual form of a(-) is, up to a constant, the proportionally fair opti- 
mization problem. These arguments give Theorem 6.1. With this we are able 
to prove Theorem 7.2. Theorem 7.2 states that the stationary throughput 
of a closed multi-class queueing network converges to a proportionally fair 
allocation as the number of customers (or packets) is increased in propor- 
tion to some fixed vector. Proportionally fair optimization occurs because 
the states of the queueing network collapse to a set of entropy minimizing 
states with proportionally fair throughput. These results are analogous to 
the heavy-traffic notion of state space collapse [4, 16, 24]. These results em- 
phasize a large deviations duality between network state and network flow. 
Please see Figure 1 for a diagram of the structure of these results and the 
sections that they are contained in. 

In addition, the multi-class queueing networks considered in this paper 
are known to be quasi-reversible and thus have a product form equilibrium 
distribution [11]. The equivalence between reversibility and insensitivity is 
well studied [3, 21, 22, 26]. Our macroscopic model, the spinning network, 
inherits the reversibility property from these multi-class queueing networks 
and thus, as has been observed by Massoulie, the spinning network is in- 
sensitive. Thus for multi-class queueing networks we find at each different 
level of granularity a different feature of the system can be observed: prod- 
uct form at the microscopic level; insensitivity at the macroscopic level and 
proportional fairness at the teleological level. 

Observations on insensitivity and the product form stationary behavior of 
proportional fairness have previously been made. By considering balanced 
fairness, the connection between insensitivity and proportional fairness has 
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been given by Massoulie [17]. Also, motivated by diffusion approximation 
behavior in queueing networks, Kang, Kelly, Lee and Williams [9, 10, 16] 
considered heavy-traffic approximations of stochastic flow level models op- 
erating under proportional fairness and found them to have a product form 
stationary distribution. 

A final observation is that the multi-class queueing networks considered 
here have no prescribed optimization structure. Thus it is surprising to see 
that asymptotically these networks are implicitly solving a utility optimiza- 
tion problem. 

1.1. Organization. The sections of the paper are structured as follows. In 
the next subsection we introduce notation, define proportionally fair band- 
width allocations and the proportionally fair optimization problem. In Sec- 
tion 2 we define the multi-class queueing networks considered in this paper 
and state results on their stationary distribution. In Section 3 we introduce 
stochastic flow level models, define the spinning network and state Theo- 
rem 3.1. In Section 4 we discuss the reversibility, insensitivity, stability and 
stationary behavior of the spinning network. In Section 5 we present the ar- 
gument given by Schweitzer [23], Kelly [13] and Massoulie and Roberts [19] 
relating proportional fairness and multi-class queueing networks. In Section 

6 we consider the large deviations behavior of the multi-class single-server 
queueing networks presented in Section 2 and prove Theorem 6.1. In Section 

7 we discuss the collapse in state space brought about by large deviations 
principle Theorem 6.1 and prove convergence of stationary throughput in 
Theorem 7.2. In the Appendix we give the proof of Theorem 3.1 and prove 
additional lemmas and propositions from within the text. 

1.2. Notation: Network structure and proportional fairness. We let a fl- 
nite set JT" index the set of queues in a network. Let J = \J'\. A route 
through the network is a nonempty set of queues. Let I C 2'^ be the set 
of routes. Let I = For each route i = . . . G T, we associate an 
order (ji, . . . , Jfc.). Also we define the set of queue-route incidences, IC := 
{{j,i) -i G I,j & iJ,j G i} and let K = |/C|. We will view our multi-class 
queueing network model as transferring a number of documents across the 
different routes of the network. In Sections 2, 3, 4 and Appendix A.l, the 
vector n = [ui : i G X) € will be used to refer to the number of documents 
in transfer across the routes of the network. When referring to large devia- 
tions characteristics, in Sections 6 and 7, n = {ui : i G X) € will be used 
to refer to the proportion of documents in transfer across routes. We will 
consider the documents in our multi-class queueing network to be trans- 
ferred by packets which will traverse the network. Each document on each 
route will have only one packet in transfer across the network at any point 
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in time. Thus, given our description of n, the total number (or proportion) 
of packets in transfer across route i will be nj. We also consider the vector 
m = {rriji : G /C) € M;^. In Sections 2, 3, 4 and Appendix A.l, rriji € 
will be used to refer to the number of packets in transfer across route i that 
are at queue j. Similarly in Sections 6 and 7, rriji G ^+ will be used to refer 
to the proportion of the packets in transfer, that are on route i and in queue 
j. Because each packet in transfer will correspond to a document we have 
that 

rij = ^ rriji Vi € I. 

We define the number (or proportion) of packets at a queue to be 

nij := ^ rriji Vj G J. 

For each n E we define S{n) = {m G : J2j ■ jei iT^-ji = ni\/i Gl}, the set 
of queue states achievable given the number of documents in transfer. 

With each queue j £ we associate a service capacity Cj . A bandwidth 
allocation is a vector A(n) = (Aj(n) : i € X) € for each n G Z^. A band- 
width allocation is said to be feasible, if Vn G 

J2 ^ Cj Vj G J. 

A bandwidth allocation corresponds to the rate documents are transferred 
across their route given the number of documents in transfer. Bandwidth 
allocations form an abstraction of the stationary transfer rate achieved by 
rate control algorithms used, for example, in the internet. 

A bandwidth allocation A^^(n) is proportionally fair [15] if Vn G M;?^, 
Af^(n) = when nj = and A^^(n) solves 

(1.1) maximize ^njlogAj, 

(1.2) subject to J2 ^ Cj ^ ^' 

i:jei 

(1.3) overAi>0 V?gT. 

For all vectors x G M'^ we define [x\ := ([xij,..., [x^J), the lower inte- 
ger part of each component. Unless stated otherwise ||x|| = max^/ [x^'l, the 
supremum norm. Finally, we will define that for each m G Z^ 

mjl 
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2. A microscopic queueing model. In this section we introduce our mi- 
croscopic model. Tlie queueing networks considered here are exactly the 
networks with fixed service capacity described in Sections 3.1 and 3.4 of 
Kelly [11]. We interpret these multi-class queueing networks as a Markov 
chain model of document transfer across a packet switching network. This 
interpretation has previously been considered by Massoulie and Roberts [19] 
and Bonald and Proutiere [3]. Documents wishing to be transferred across 
a network are broken into a number of packets. The document's packets are 
then sent across the network one by one, so that a new packet is sent into 
the network once the packet in the network has completed its route. We 
define our queueing model in the next three paragraphs and call it an open 
multi- class queueing network with spinning. 

We consider a network of queues indexed by the set J' queues process 
packets. Each packet moves along a fixed route from the set I. Each queue 
j ^ J may store an infinite number of packets and has a fixed service ca- 
pacity Cj G (0, oo). Each packet at a queue has a position in that queue, for 
example, if there are ruj packets at queue j then these packets are stored in 
positions k = 1, . . . ,mj. The total service capacity of a queue is then divided 
between the different packets at the queue. Each queue operates under a 
service discipline that cannot discriminate between the routes used by its 
packets. More explicitly there exists a function ^j{k,mj) that gives the pro- 
portion of service devoted to the packet in position k in queue j when there 
are ruj packets at queue j. As 7^ represents a proportion 



Similarly when joining a queue a packet may only chose its position as a 
function of the number of packets at that queue. These service disciplines 
are described in Kelly [11], pages 58-60, and, for example, include first in 
first out, last come first served, processor sharing, and symmetric queues. 

Documents for transfer on route i € T arrive as a Poisson process of rate 
Vi. Each route i document consists of a discrete number of packets. This 
number is independent, finite mean and with distribution equal to random 
variable Xj. We let = (E,Xi)~^ and let Pi = ^ for all i € T. Each packet 
has an independent, exponentially distributed mean 1 service requirement 
at each queue. 

Consider route i G T with route order ( j| , . . . , j\, ) . If we wish to transfer 
a document across route z, a packet is sent along route i. It will first join 
queue j\ . For = 1, . . . , fcj — 1 on departing queue j], a packet will join queue 
Jk+i- When a packet leaves its final queue j{i) = a new packet is sent 
along route i until all packets in the document are transferred. 
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Equivalently, one could think of each document being transferred by a 
single packet which repeats its route with some probability. This probability 
only depends on the number of repetitions of its route the packet has made 
so far. This interpretation motivates the use of the word "spinning." These 
models are equivalent to the networks with fixed service capacity considered 
in Kelly [11], Section 3.1. All the results above and all the proportional 
fairness results in Sections 6 and 7 apply to this case. 

We could explicitly describe the state of this network by recording the 
position of each packet at each queue, the route used by these packets, the 
number of repetitions such packets have made on their route and the to- 
tal number of repetitions these packets must make. As noted in [11], the 
stochastic process recording this information is a Markov chain. We will 
not be interested in this explicit description. We will be interested in sim- 
pler quantities, namely the number of documents in transfer across routes, 
the number of packets in transfer on each route at each queue and the 
throughput of packets of each route at each queue. In general, the processes 
associated with these quantities will not be Markov. 

We now consider the stationary distribution of this model. The following 
result is a direct consequence of the Theorem 3.1 of Kelly [11]. 

Proposition 2.1. An open multi-class queueing network with spinning 
is ergodic if and only if 

(2.2) Y.Pi< G J. 

When ergodic, M = {Mji : (j, i) £ IC), the number of packets in transfer across 
each route at each queue, has stationary distribution, 

,,3) r,M_,.B-.^((,„,;"',,).^(l^)■"") 

for each m G Z,^, where 
(2.4) B 

Proof. Allow the state of a packet to be given by the packet's route, 
the packet's position in its current queue, the total number of repetitions 
the packet makes of its route and the number of repetitions currently made. 
From this we have a Markov chain of the form described in Section 3.1 of 
[11]. Applying Theorem 3.1 of [11] to find the stationary distribution and 
summing over the correct states gains the result. □ 



n 



The next two corollaries are an immediate consequence of this result. 
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Corollary 2.1. N = {Ni/.i sT) the number of documents in transfer 
has stationary distribution 

(2.5) FiN = n) = ^l[pr VnGZ^, 

where we define 

M E n((,„;^3i)n(^)'"'") v„.zi. 

A closed multi-class queueing network behaves as an open multi-class 
queueing network except that document arrivals and departures are for- 
bidden (see [11]). So, the network behaves as if there is a fixed number of 
infinitely large documents in transfer. We assume throughout this paper: 

Assumption 1 . Consider the Markov chain description of a closed queue- 
ing network that records the position of each packet at each queue and each 
packet's route [11], Section 3.4. Civen the number of packets on each route, 
consider the set of all possible states of the Markov chain. We assume this 
set of states is irreducible. 

This assumption excludes reducibility issues which can only occur in 
closed queueing networks where a queue serves a single deterministically 
chosen packet. It is worth noting that if Assumption 1 is broken then there 
need not be a unique stationary distribution or a unique stationary through- 
put for the closed queueing network. 

Corollary 2.2. For a closed multi-class queueing network with n G 
documents in transfer across routes, the number of packets in transfer of each 
route at each queue has stationary distribution 

M F.(M^™)^«-n((„,.':^,,),n(^ 

for each m G Z^, where Bn is defined by (2.6). 

Finally we can characterize the stationary throughput of these closed 
multi-class queueing networks. 

Corollary 2.3. For a closed multi-class queueing network with n G Z^ 
documents in transfer across routes and with ni> 0, the stationary through- 
put of route i packets at queue j & i is 

Af(n):=%-, 
where Bn is defined by (2.6) and ei is the ith unit vector in M.i_. 



PROPORTIONAL FAIRNESS AND QUEUEING NETWORKS 



9 



Proof. The probabihty the network is in state m € is given by (2.7). 
Given the network is in state m, by Corollary 3.4 of [11], the probability in 
queue j the packet position k € {1, ... ,mj} is traversing route i is The 

throughput of the packet in position k of queue j is 'yj{k,mj)Cj. Thus the 
stationary throughput of the network is 

y yjJk,m,)C,^^T](( ^'iTTf-^"'" 



m'eS{n-ei) " «Gj ^ r : ler ^ '■ ^ ' 

We used (2.1) in the first inequality; in the second we cancelled terms and 
substituted = mir — 1 if (/,r) = (j, i) and mj^ = m/r otherwise. □ 

We will define A^'^(n) as a bandwidth allocation in the next section. 

3. Limit to a macroscopic model. For an electrical network at the macro- 
scopic level, we considered the current through the network and not the ex- 
plicit behavior of individual electrons. Similarly for a packet switched net- 
work, we may wish to consider the rate documents are transferred through 
the network and not the explicit location of packets. Stochastic flow level 
models provide such a model of document transfer as they do not explic- 
itly consider packets. In this section we justify how a series of multi-class 
queueing networks with spinning converges in the Skorohod topology to a 
stochastic flow level model. Thus we limit from a model where documents 
are transferred by sending discrete packets to a model where documents are 
transferred at a dynamic, elastic rate. 

A key quantity in this analysis will be the stationary throughput of a 
closed multi-class queueing network. 

Definition 1 (Spinning allocation). For all n € iJj^^ the spinning allo- 
cation, denoted A^'^(n) = (A^^(n) :i S T), is the stationary throughput of 
packets on each route of a closed multi-class queueing network with n doc- 
uments in transfer. More explicitly from Corollary 2.3 we know that, Mi^Z 
and n € Z^ 

0, otherwise. 
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where Bn is defined by (2.6). 

Under the name "The store-forward ahocation," the spinning allocation 
is cited by Proutiere [21] as the first insensitive bandwidth allocation. The 
definition of this bandwidth allocation from multi-class queueing networks 
is due to Massoulie [21], Section 3.4. Our macroscopic model of interest will 
be the following. 

Definition 2 (Stochastic fiow level model). A stochastic flow level 
model operating under bandwidth allocation A(-) is a continuous-time Markov 
chain on Z{ with rates 



Definition 3 (Spinning network). The spinning network is the stochas- 
tic flow level model operating under the spinning allocation. 

Stochastic flow level models were first considered by Roberts and Mas- 
soulie [18]. This model can be interpreted as follows. Documents wishing to 
be transferred across route i arrive as a Poisson process of rate fj. These 
documents are assumed to have a size that is independent and exponentially 
distributed with mean If currently the number of documents in transfer 
across routes is given by vector n G then each document on route i is 

transferred at rate ^'^"^ . Documents are then processed at this rate until 
there is a change in the network's state, either by a document transfer being 
completed and thus leaving the network, or by a document arrival occur- 
ring. Thanks to the memoryless property of our process we need not record 
residual document sizes when an arrival or departure event occurs. 

We now introduce the sequence of multi-class queueing networks which 
we will limit to form our macroscopic model. Consider a sequence of open 
multi-class queueing networks with spinning, {(mW (t) : t e M+)}cGN. These 
networks have the same routing structure and document arrival processes as 
described in the last section. In this section, we assume for simplicity that 
each queue is processor sharing. Thus M^^^ is a Markov chain. We increase 
the rate at which packets are transferred through the network. In the cth 
network each queue j operates at service rate cCj. We also increase the size 
of documents, so as not to increase the rate that documents are transferred 
through the network. We assume route i documents are geometrically dis- 
tributed with parameter fii/c. We also let A^^"^) be the stochastic process for 



(3.2) 




if n' = n + Cj, 

if n' = n — ei and rii > 0, 

otherwise, 



e, is the ith unit vector in 
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the number of documents of each route in transfer, that is, Vi S T, Vt € M+ 

Let us consider intuitively how these networks hmit as c — > oo. Note tran- 
sitions of packets between queues occur at times of order O(^). Thus the 
number of packets sent by a route i document in transfer per unit time is 
0(c). The probabihty that a packet sent is the final packet is so the time 
until this document is transferred is — 0(c) = 0{1). Thus there is a separa- 
tion of time scales between document transfer and packet transfer. Between 
arrival and departure times the network behaves as a closed queueing net- 
work. By the ergodic theorem, for large c, this closed queueing network 
will behave close to its stationary distribution. Thus between arrival and 
departure times documents experience a transfer rate determined by the 
stationary throughput of a closed queueing network. Noting Definition 1 
and Corollary 2.3, we have defined this transfer rate to be the spinning allo- 
cation. Also, the increased rate of packet transfer and the geometric number 
of packets in a document suggests an exponential distribution limit for the 
size of documents. Thus it seems plausible that a stochastic flow level model 
would be the limit of these queueing networks and that Vn € Z;?^, the rate 
of transfer, would be determined by the spinning allocation. 

We will prove this assertion is correct. The formal statement of this con- 
vergence result is the following theorem. For our multi-class queueing net- 
work the theorem rigorises the separation of times scales assumption in 
Massoulie and Roberts [18, 19]. Here we define the Skorohod topology with 
any norm on M^. 



Theorem 3.1. For each c € N, take an open multi-class queueing net- 
work with spinning M^^^ as described above. We assume queues are processor 
sharing. Let N^°°^ denote the number of documents in transfer in the spin- 
ning network, (3.1) and (3.2). If 

as c — > oo, 

then, in the Skorohod topology on interval [0,1], 

^(c)^^(oo) asc^oo. 



Due to its technical nature this result is proven in the Appendix. The 
proof uses a coupling argument. The key idea in this proof is to let the 
internal behavior of each queueing network between arrival and departure 
times be governed by the same process while allowing the number of packets 
sent before a departure to converge almost surely. 
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Remark 1. One could use a similar model of a multi-class queueing 
network with spinning where the number of packets sent is not geomet- 
rically distributed and perform this limit. In this way one would model 
the transfer of documents of any positive distribution. Over c G N and 
these different document size distributions, the stationary distribution of 
A^C*^) will be unchanged provided the mean document size is scaled so that 
MW = (c//i:zGT). 

Remark 2. The processor sharing assumption is not needed in Theorem 
3.1. In general only Assumption 1 will be needed. We can prove Theorem 3.1 
when queues are not processor sharing by replacing M^^^ with the explicit 
Markov chain description of the queueing network outlined in Section 2. 

4. A macroscopic stochastic flow level model. In the last section we in- 
troduced stochastic flow level models and justified how one such model, the 
spinning network, formed a macroscopic model of the queueing networks 
considered in Section 2. Now we will discuss some properties of the spinning 
network. In particular, we show its stationary distribution to be insensitive 
to different document size distributions. Though we first note: 

Lemma 4.1. The spinning allocation is a feasible bandwidth allocation. 

Proof. For processor sharing queues, A?^(n) = En[^CjI[Mj > 0]], so 

EAf(n)=EE. 

i-.jai 

We can extend the definition of a stochastic flow level model so that 
the sizes of incoming documents are independent and of any positive dis- 
tribution. Information on residual document sizes would be needed for such 
processes to be Markov. Given this extension, a stochastic flow level model 
with mean document sizes given by (/^^^ : i G T) is insensitive if the station- 
ary distribution for the number of documents in transfer is the same as all 
other stochastic flow level models with the same mean document sizes. 

The stationary distribution of an open multi-class queueing network with 
spinning (2.3) depends on the distribution of the number of packets in a 
document only through mean document size (/^^^ :i G T). In this sense an 
open multi-class queueing network with spinning is insensitive. By the same 
scaling in Theorem 3.1 we could increase the network's service capacity 
and limit the discrete document size distribution to approximate continuous 
document size distributions. Under this scaling the stationary distribution 
(2.3) still depends on the distribution of the number of packets in a document 



C,l\M,>Q] 



■ CjFn{Mj >0) <Cj. 



n 
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only through parameters (/Xj ■.i&I). Thus given Theorem 3.1, it is reasonable 
to think that its limit, the spinning network, would be insensitive. 

Bonald and Proutiere [3, 21] found that key results on the insensitivity 
of stochastic flow level models are a consequence of existing results on the 
insensitivity and reversibility of Whittle networks [22, 26]. 

Proposition 4.1 (Bonald and Proutiere [3]). An ergodic stochastic flow 
level model operating under bandwidth allocation A(-) is insensitive if and 
only if it is reversible, that is, there exists function $ : — > M+ with $(0) = 
1, $(n) =0 \/n(^Z{ and 

^*W = %7^ VnGZ^,zGT, 
$(n) ^ 

moreover, 

(4.1) 7r{n) = ^{n)l[pr, 

forms an invariant measure for the number of documents in transfer. 

We now find the stationary distribution of the spinning network and show 
it to be insensitive; this fact has been observed by Proutiere [21] and Bonald 
and Proutiere [3]. 

Proposition 4.2. The spinning network is reversible and insensitive to 
document size distributions. The spinning network is ergodic if and only if 

(4.2) T.P^< Vi G J. 

i : jei 

The spinning network has the same stationary distribution as the number of 
documents in transfer in an open multi- class queueing network with spin- 
ning, that is distribution, (2.5). 

Proof. Insensitivity and the reversible property are an immediate con- 
sequence of Proposition 4.1 and Definition 1. By Proposition 4.1 

^{n):=Bn\{pT VnGZ:^, 

is an invariant measure. The sum of 7r(-) over all states is finite if and only 
if the stability condition (4.2) holds. When finite this sum equals B given 
by (2.4) thus giving stationary distribution (2.5). □ 
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5. Relating multi-class queueing networks to proportional fairness. In 

1979, Schweitzer [23] studied approximations of closed multi-class queueing 
networks and considered how asymptotic conditions on such networks might 
satisfy the Kuhn-Tucker conditions for proportionally fair optimization. In 
1989, Kelly [13] studied approximations of closed queueing networks and 
by an analogous analysis considered a similar optimization formulation. In 
1999, Massoulie and Roberts [19] studied a fluid-type queueing model and 
used these same Kuhn-Tucker conditions to deduce proportional fairness. To 
develop intuition and to motivate Sections 6 and 7, we present the argument 
used in these three papers. 

As given in Section 2, consider a closed multi-class queueing network with 
Hi documents in transfer on each route i €l. Let qj be the mean sojourn 
time of a packet at queue j; let fhji be the mean number of route i packets 
in transfer at queue j, and let Aj be the mean sending rate of route i packets 
into the network. By Little's law, 

(5.1) AiQj = fhji V(j, i) e IC. 
Summing over j € i and rearranging gives 

(5.2) _i_^g. = o Vie J. 

Since queues are stable we know 

(5.3) E ^ ^ ^- 

One can imagine if equality (5.3) is strict then qj ~ 0. Thus approximately 

(5.4) qj{Cj-Y.A^=Q ^j&J. 

Also 

(5.5) qj > Vj e and A^ > Vi e Z. 

Interpreting [qj '.j^J) as Lagrange multipliers, (5.2)-(5.5) are precisely the 
Kuhn-Tucker conditions for the proportionally fair optimization problem 

max > rij log Aj subject to > Aj < Cj '^j ^ J ■ 

So from this one can deduce that Aj = Af^(n) Vi e X. 

To make this argument we assumed that the sojourn times of packets did 
not depend on the route used and that complementary slackness condition 
(5.4) held. Neither of these conditions need be true in general. In fact, from 
Corollary 2.3, know that Aj = A?^(?i), the spinning allocation. In general 
A^'^(n) ^ A?^(n). Even so it is reasonable to assume A?'^(n) ~ Af^(n). 
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In the following two sections we rigorously prove a relationship between 
multi-class queueing networks and proportional fairness. We consider a multi- 
class network of single-server queues, as described in Section 2. We let the 
number of documents in transfer get large but in proportion to some fixed 
vector n S We show that these multi-class queueing networks asymp- 
totically allocate service across routes as a proportionally fair optimizer. For 
example, we will prove that for all i €l 

fi— >cx> 

6. Limit to a teleological description. In the Introduction we noted how 
minimizing energy dissipation gave an optimization description of an elec- 
trical network. In this section we wish to justify how proportional fairness 
provides an optimization description for the open multi-class queueing net- 
works discussed in Section 2. 

To do this we allow the number of documents in transfer to be large and 
in proportion to some fixed vector. The main result in this section is The- 
orem 6.1 where we prove a large deviation principle for stochastic models 
with stationary distribution (2.5). This stationary distribution includes the 
number of documents in transfer for all open multi-class queueing networks 
discussed in Section 2 and the spinning network with any finite mean docu- 
ment size distribution. This large deviations approach for queueing networks 
is similar to that given by Pittel [20], although Pittel does not consider a 
relationship with proportional fairness in his analysis. In addition, as we 
consider the large deviations of an insensitive stochastic flow level model 
this approach is also similar to that taken by Massoulie [17] for balanced 
fairness. The large deviation rate function found in Theorem 6.1 is 

(6.1) ap(n) = max ^ njlog— subject to ^ Aj < Vj G ^. 

When optimizing the above expression we can express the pi terms as ad- 
ditive constants. Thus the argument maximizing this optimization problem 
is the proportionally fair allocation A^^(?i). From this we see that these 
queueing models are related to proportionally fair optimization. 

To prove Theorem 6.1, first we prove a large deviation principle for the 
stationary distribution (2.3). Stirling's formula finds a rate function /3p(-)- 
Applying the contraction mapping principle gives the large deviation prin- 
ciple for the number of documents in transfer and finds ap(-) expressed as 
the primal of a convex optimization problem. We calculate the dual of this 
optimization problem and find it to be of the form of (6.1). 

We start by finding rate function /3p(-). 
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Lemma 6.1. Suppose M is a random variable in Z,^ with distribution 
(2.3). If we take a vector m € and take {d^'^^}h£n a sequence of vectors 
in R-'^ such that km + d^'^) S and sup^ \d^^'''>\ < oo then 



where we define 
(6.2) 



lim \\og¥{M = hm + d^^'^) = -I3p{m), 



(3p{m):= V mjtlog ^^'^^ . 



Proof. For all j G J, define df"^ =J2i:jeid^ji^- By Stirling's formula, 
lim ^logF{M = hm + d^'''>) 



lim — 

h—>oo h 



Y^logihmj+d'f^y. 



J2 log(/im,, + 4^))! 

(j,i)e'C 



lim — 

h—>co h 



^ ((/im,- + df^)\og{hm, + - {hnij + 0) 



^ {{hmji + log(/im,-i + 4f ) - {hnij, + 4f )) 



hm 2^ log — = -Pp[m). 



h^oo 



{j,i)6/C: 
mji>0 



{nij + df^ /h)pi 



□ 



Remark 3. The Kullback-Leibler divergence or relative entropy of dis- 
tributions p and (7 on X is 



D{p\\q)=YPilogl- 
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In our definition of /5p(-), if we define for each j , p' = {-^ :i3 j) and 
^^ = (v^-^3-?')' then 



Pp{^)= rnjD{p'\\q')+ ^ jn^log 



So /?p(-) is a linear combination of Kullback-Leibler divergences. Normally 
we consider proportional fairness to maximize utility subject to constraints 
on flows. The duality given in Theorem 6.1 motivates us instead to view 
proportional fairness as minimizing entropy subject to constraints on pack- 
ets. 

Note that, since x\ogx is a continuous function, (5p{-) is a continuous 
function. We define xlogx := when x = 0. Note also Lemma 6.1 applies 
for d^'*) = hm — \ hm\ . From this lemma the following result is reasonable. 

Proposition 6.1. If M is a random variable in with distribution 
(2.3), then, as h ^ oo, {j[}hem obeys a large deviation principle on M.^ with 
good rate function f3p{-). That is for all D C M;^ Borel measurable 

M \ ,. . ^{M 



- inf < liminf logP — G D < lim sup log P — € -D 

meD° /i-»oo \ h J h-^OD \ h 

< - inf f3p{m), 

where D° is the interior of D and D is the closure of D. 

A proof of this proposition can be found in the Appendix. To prove the 
main theorem of this section we require two lemmas. 

Lemma 6.2. For all A e {0,ooy , 



inf /3a (m) 



0, ^/ E ^^^C-j-'Vie J, 



-OO, otherwise. 

Lemma 6.3. For a// A G (0, oo)^, /3a(-) is a convex function. 

For proofs of Lemmas 6.2 and 6.3 see the Appendix. We now prove the 
main theorem of this section. We use the contraction principle (see [6], page 
126). Recall that distribution (2.5) is the stationary distribution for the 
number of documents in transfer for all open multi-class queueing networks 
discussed in Section 2 and the spinning network. The following theorem 
expresses an important duality between network state and network flow. 
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Theorem 6.1. If N is a random variable in iJj^ with distribution (2.5) 
then as h ^ oo, {-^jhGN obeys a large deviation principle on M.i^ with good 
rate function 

an(n) := min m,jlog — it—l subject to 

+ 0,«)e/C: ■'^ 

m,>0 

(6.3) 

niji = ni Mi 

(6.4) = max "S^ Uilog— subject to Aj < Cj Mi G J. 

That is for all A C Borel measurable, we have that 

[N \ [N \ 

— inf apin) < liminf logP — S ^ < limsuplogP — ^ ^ inf_ap(n), 

where A° is the interior of A and A is the closure of A. 

Proof. Apply the contraction principle to Proposition 6.1 using con- 
tinuous map /:M^ — > such that /(m) = (X]j: jei'^i* S 2r). This gives 
that {"^j/igN obeys a large deviation principle with good rate function 

ap{n)= min ^ niji log ^^^^^ subject to ^ mji = ni \/i&l. 

mj>0 

By Lemma 6.3, this is a convex optimization problem. Let us calculate its 
dual formulation. Taking Lagrange multipliers A S , its Lagrangian is 

L{m,\)= "'i^^°S?f^+ J2 XiUi-Y^mji 

{j,i)£K: i:ni>0 ^ 

mj>0,ni>0 

= V rujilog ^^'^^ + V x^m. 

mj>0,ni>0 

By Lemma 6.2, 

^ n,Xi, if J2 Pie^'<Cj,yjeJ, 



min L{m, A) 



i:n,>0 i-.jGi 

— OO, otherwise. 



Thus we find dual 

ap(n) = max ^ rijAi subject to ^ Pie^^ <Cj. 

i : ni>0 i : j£i 
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Substituting Aj = pie^^ gives 

Q(,(n) = max rii log — subject to Aj < Cj Vj € J ■ 

7. Teleological description. In Section 6 we saw that a proportionally 
fair rate function determined the large deviations behavior of the station- 
ary distribution of an open multi-class queueing network. In this section we 
discuss what this means for the behavior of packets and for the rate of doc- 
ument transfer in these networks. The queueing networks defined in Section 
2 have no prescribed optimization structure. Even so, as the number of doc- 
uments gets large, a network will be restricted to its most probable states, 
and because of this the network behaves as an optimizer. This notion of a 
queueing network collapsing to its most probable states is analogous to the 
heavy-traffic notion of state space collapse [4, 16, 24]. 

As in Section 6, we study the limit as the number of documents in transfer 
gets large but in proportion to some fixed vector. In this section, we charac- 
terize the solutions to the primal problem (6.3). In Theorem 7.1 show that 
the state of packets in an open multi-class queueing network with spinning 
converges in probability to the set of solutions of the primal problem. In 
Corollary 7.1, we show that at each queue the number of packets in trans- 
fer on each route converges in L} to a proportionally fair proportion of the 
number of packets at the queue. Finally in Theorem 7.2, we show that the 
stationary rate documents are transferred through these queueing networks 
converges to a proportionally fair allocation. We define 

(j,i)e/C: 

mj>0 

We now characterize the solutions to the primal problem (6.3). 

Proposition 7.1. Given n G and m* G M^' such that for all i el, 
'^^i = then m* solves primal problem 

(7.1) min p(m) subject to }^mji=ni ViGT, 
i/V(j, i)G/C 

(7.2) m*,Cj=m*Af^{n). 

Proof. Suppose m* minimizes (7.1). We show (7.2) holds for some fixed 
(j, i) G /C. If m* = then m*^ = thus (7.2) holds. Now assume that m* > 0, 
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by the strong duality of primal (6.3) and dual (6.4) we have 
Y n^logA^ 



r : nr>0 

= min V m;rlog— C/+ V log A^^(?z) ( - V m;, 

m;>0,nr>0 

and also m* minimizes this Lagrangian problem. The above expression gives 

E\ - mir mirCi 
/ log . D„, , = 0. 

nr>0 

By Lemma A. 5, if m* > then Vi 3 j, m*jfij = m|Af^(n), so (7.2) holds. 
Now we prove the converse. If (7.2) holds then Vj G J such that m* > 

-^^ = K (n), 

therefore, 

/?("i*) = E E "^FrlogAP^(n)= nr\ogh^/{n). 



rSXl-.ler r:nr>0 



□ 



We define VnSM^, 



M{n) = im G M:^ : mjiCj = mjAf^{n) V(i, i) G /C, ^ mji = Vz G j|. 

In heavy-traffic literature A4{n) would be thought of as an invariant man- 
ifold. Note that if a network of processor sharing queues were in state 
m* G Ai{n), then the transfer rate allocated to route i packets would be 
Af^(n). Since rare events occur in the most likely way, one may expect, as 
the number of documents in transfer gets large, that the state of the queues 
in the network will be close to A4{n). Thus the rate of document transfer 
will be close to a proportionally fair allocation. We use this intuition to prove 
the next theorem but first we will require a lemma. Lemma 7.1 shows that 
the rate decay in probability is larger away from the manifold A4{n). 

Lemma 7.1. For all e > 0, 3/(e) > and 6{e) > such that \/6 < 6{e) 

i:ni>0 



Where we define Ve > and 6 >0 

Pis'-— ^i'^ PpiiT^) subject to max 



E 



Tflji Til 



<6 
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and 

inf \\m — m'W > e. 

m'eM(n) 

See the Appendix for a proof of this result. We could interpret the follow- 
ing result as a state space collapse result [4, 16]; the result shows that the 
state of the queueing network converges in probability to the invariant man- 
ifold. Recall that all open queueing networks in Section 2 have stationary 
distribution (2.3). 

Theorem 7.1. Let M be any stochastic process on M.^ with stationary 
distribution (2.3) and let Ni = X^jgi ^ji^ Vi € X then Vn € and e > 

M 

m' >£ N = [hn\ ) > 0. 



PI inf 



h 



Proof. For all n € M;^ and V5 > 0, one has that eventually in h, 
M 



inf 

m'&M{n) 



h 



m 



>£ 



N = [hn] 



B 



(7.3) 



< 



B 



\hni\ ' 



PI inf 



< 



B[hn\ YlidlP 

B 



\hni\ ' 



PI inf 

m'gA4(n) 



M 

T 

M 
~h 



< 



B 



m 



m 



fie) 



>e,N= [hn]^ 



N 



n 



<6 



ex.pl -h ^ rijlog 

^ i:n.i>0 



We used Proposition 6.1 for the second inequality and Lemma 7.1 for the 
final inequality. Taking any m* G A4{n), there exists a sequence m^'^^ with 

/imW GZf , Ej:j&hm^'l^ = [hn,\ ViGTand l/imj-^^ - /im*J < 2 V(j, i) G /C. 
So by Lemma 6.1 one has 

lim ilogP(M = /im('')) = -/3„(?n*) = - V riilog^^^^. 

h-*oo h 

Thus 3h' such that V/i > h' 



i : ni>0 



(7.4) P(M = /imW) >exp(-/i ^ n.log^^i—^ - 



Pi 
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Hence one has that, eventuaUy in h, 

M 



inf 

m'eM{n) 
< 



h 



m 



>e 



N = [hn\ 



B 



B[hni YlieiPi 



[hriil 



X exp 



m log 



Ar(n) 



fie) 



i : 7ii>0 



< 



B 



[hriil 



Bihn] UieiPi 
: P(M = hm^''^\N = [hn\ )e-^f(^^/^ 



h—>oo 



►0. 



We used (7.3) for the first inequahty and (7.4) for the second inequahty. □ 

Remark 4. Pittel [20] proves Theorem 7.1 under the assumption that 
the manifold Ai{n) consists of a single point. This assumption meant Pittel 
did not require a result like Proposition 7.1. More generally Pittel's argu- 
ments do not make the primal-dual connection between queueing networks 
and proportional fairness. Other networks have a similar primal-dual large 
deviations connection between the network state and network flow (e.g., 
Loss Networks, see Kelly [12]). 

We now consider what Theorem 7.1 implies for packets at each queue. 

Corollary 7.1. V?i G Mi and G /C 



-E[\Mj,C, - M,-AP (n)||iV = [hn]] > 



0. 



Proof. Take e > 0, by Theorem 7.1 and as nijiCj = mjAf^(n) Vm G 
M(n) 



^E[\M,,C,-MX''in)\\N=lhn\] 



1' 



-E 



n 



MjiCj-MjAf^{n] 
MjiCj -MjAf^{n)\I 



inf 



M 



m 



> . 



inf 

m'£M{n) 



M 



m 



< £ 



N = \ hn\ 
N = \ hn\ 



<2^n, C.P 
il + I)Cje. 



sup 



M 

T 



m 



>£ 



N= [hn\ +{l+I)Cje 
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For the first term before the inequality we use that Mji < Mj < J2i3j 
and Af^{n) < Cj. For the second term we use that \Mji — hm'j^\ < he and 
that m' satisfies (7.2). Since e is arbitrary the resuh holds. □ 



The following result is the formal statement of (5.1) in Section 5. 



Lemma 7.2. For all G /C and for all n G with rii > 0, 



(7.5) 



AfN(n)E 





N = n — ei 


[ C, 





Proof. Since E„_eJ-^^^^T^] is the expected sojourn of a route i packet 
at queue j the above result is really a statement of Little's law. We can show 
the result by explicit calculation, 



Ar( 



n 



Bn 1 

-Bn—ei Bn 



J2 "^oi n 

m&S{n): l&J 
mji>0 



mir :r 3 I 



n 

r : l£r 



m - 5ij 

mir - 6irji:rBlJ J-J: \Ci 



n A 



:r3 I 



n 

r : Idr 



We define 5x,y '■= 1 for x = y and 5x,y ■= otherwise. Above we canceled 
terms and substituted mj^ = mir- — Sirji ■ □ 



Consider any of the multi-class queueing networks considered in Section 2. 
Recall from Corollary 2.3 that, given the number of documents in transfer is 
n, the stationary rate route i documents are transferred through the network 
is A^^(n), the spinning allocation. We now prove that this rate converges to 
a proportionally fair bandwidth allocation. In this asymptotic sense these 
queueing networks behave as a proportionally fair optimizer. 

Theorem 7.2. For all n e and Mi el 

Af(L/^nJ)— ^Ar(n). 

n-— >oo 
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Proof. If = the result is trivially true, so we assume iii > 0. By the 
arguments in Theorem 7.1 and Corollary 7.1, one can see that 



-n\MjiCj - M,-AP (n)||iV = [hn\ - a] > 

n h—foo 



0; 



therefore, 



1 



E 



M,,-^Af''{n)N=[hn\-e, 



Summing over j € i gives 



rii 



h 



E 



N = lhn\ - Ci 



h—>oo 



h— >oo 



'0. 



substituting expression (7.5) gives 
and finally rearranging gives the result 

n— +00 



□ 



APPENDIX 

A.l. Proof of convergence to the spinning network. The proof provided 
here gives the first rigorous proof of a packet level model of document trans- 
fer converging weakly to a stochastic flow level model. 

In this section and as defined in Section 3, for c S N, let M^^^ be the 
number of packets in transfer on each route and at each queue in an open 
multi-class queueing network with spinning; let N^^^ be the number of doc- 
uments in transfer in this queueing network and let A^(°°) be the spinning 
network. Finally for c € NU {oo}, let t'''^^^ be the kth jump of that IS, 

the time of the kth document arrival or departure event. We set t'^'^'^^ := 0. 

To prove Theorem 3.1 we use a coupling argument. We prove under this 
coupling, as c — > oo, N^^^ eventually makes the same jumps as N^°°^ and 
that the associated jump times converge almost surely. This is sufficient to 
prove convergence in the Skorohod topology: 

Lemma A.l. Forc€NU{oo}, let N^^'^ -.[0,00) ^ be a nonexplosive 
jump processes with increasing jump times (t'^'^^^ : k £ If for all k G Z_)_ 

c— >oo C—tOO 

then in the Skorohod topology on [0, 1] , 
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For a proof of this result see Billingsley [1], page 137. We now work to 
form a coupling so that the first jump and jump time converge. That is, we 
will prove: 

Proposition A.l. Let n° G and for each c G N take a state m°'('^) G 
S{n^). There exists a coupling of N^°°^ with initial position hq and M^^^ 
with initial position m^'^'^^ such that, almost surely 

rUc) ,^i,(oo) ArW(^i,(c)) ,Ar(°o)(^i,(oo))_ 

c— >oo c^oo 

To prove Theorem 3.1 we will apply this result to each jump interval of 
N^^\ The proof of Proposition A.l couples each M^'^-' with a single closed 
queueing network. The result is then an application of the renewal theorem. 

As described in Section 2, let M be the number of packets in transfer 
on each route and at each queue for a closed multi-class queueing network 
with G documents in transfer and service capacities {Cj :j G J^). For 
states m G S{n^), define am to be the first time M visits state m. Since M 
is recurrent, almost surely, am < oo. As noted in Section 3, M^"^^ will behave 
as a closed queueing network until the first document arrival or departure 
time. In particular, we will define 

(A.l) M(^)(t):=M(ct + a„o,(c)) VtG[0,ri'('=)). 

The ct term ensures the correct transition rates and the o"^o,{c) ensures M^"^^ 
has the correct initial state. We will formally define t^'^^^ later. 

Let Di{t) be the number of route i packets to have been served at the 
final queue of route i in closed queueing network M by time t. By Corollary 
2.3, we know the stationary throughput of route i packets at any queue j G i 
is A?^(n''). Thus we can prove the following renewal lemma and corollary. 

Lemma A. 2. Almost surely, for all i] > 0, 

Di{ct) 



sup 

iG[0,r,] 



■Af 



Proof. Let Ri^m{t) be the number of route i packets to have been served 
at the final queue on route i by the closed queueing network M when it is in 
state m. Let ji{m) be the drift of Ri^m- For any Markov chain the process 
that records the current state of the Markov chain and the next state is also 
a Markov chain. So Ri^m is a renewal process and thus obeys the functional 
renewal theorem. That is, almost surely, for all i] > 0, 

Ri,m (ci) 



sup 

tG[0,r,] 



7i(m)t 



►0. 
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For a proof of this see Chen and Yao [5], page 106. By the definition of Di(t) 
and Corollary 2.3, we know that 

Di{t)= J2 ^^^rnit) and Af(?iO)= ^ -fi{m). 
So, almost surely, Vr? > 



sup 



Di{ct) 



< ^ sup 

m.eS(nO)*e[0,»7] 



■ji{'m)t 



-40. 



□ 



Corollary A. 2. Almost surely, 
Di{ct + a^o,{o)] 



sup 

tG[0,»7] 



Ar(nO)t 



►0. 



Proof. As M is recurrent on all states in S{n^), almost surely, am < oo 
V?7T, G 5'(n'^). Thus, by this and Lemma A. 2, almost surely, 

Di{ct + a^o,(c)] 



sup 

*6[0,7?] 



Af(n°)(t + 



+ sup 

t6[0,r,+CT^0,(c): 



c c 
Di{ct) 



Af 



>0. 



In the open queueing network M^'^'^ , we suppose each document in transfer 
on route i is geometrically distributed with parameter Thus, if > 
and assuming no other document arrival or departures occur, the time until 
the first route i document is transferred is 

5f):=inf{t:A(ct + a^o,(c)) = y/^)}, 

where Y^^^ is geometrically distributed with parameter ^ and {y}^^ : c € N) 
is independent of M. Now suppose 



(A.2) 



Y 



■Y 



(oo) 



almost surely. 



where Y-^°°^ is exponentially distributed with parameter /ij. This can be con- 
structed using the Skorohod representation theorem. Using this and Corol- 

(c) 

lary A.2 we can show that 5- converges to an exponential distribution. 



Lemma A. 3. For each i^Z such that ni > 0, almost surely 



S.. 



■s.. 



(^) 



where 3^°°^ is exponentially distributed with parameter f^iAf^ (n^) . 
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Proof. Define S, 



Y. 



(<x=) 



surely, Ve > and V?? > -jm(j;o) 
Di{ct + a^o,{c)) 



^sN(y^o) • Corollary A. 2 and (A. 2), almost 
3c' such that \/c> c' , 



sup 



y; 



(oo) 



< e. 



Hence 



c 



cY 



(oo) 



2ce 



ASN(nO) ASN(nO) 



.(oo) 



e < 



(c) 



thus 



5f ^ = inf{t > : Di{ct + a„,o,(c)) = F^'^)} > 



(oo) 



2e 



ASN(nO) ASN(nO) 



5 



(oo) 



2e 



ASN(nO)- 

By a similar argument one can see that 



2e 



ASN( 



Thus 5, 



(c) 



5'!°°'' as c— > oo, almost surely. □ 



We may choose {Y-^'^^ : c G N U {oo}) independently for each z € X with 
rii > 0. The transfer of a document on route i could be interrupted by an 
earlier document arrival or departure event. Thus letting Ei be independent 
exponentially distributed with parameter Vi for i ^I. We are interested in 
the time when the first arrival or departure time occurs, so we consider 



-Uc) .- 



min{5f ^ : > 0} A mm{Ei Vc G N U {oo}. 



By the last lemma we know that r^'W t1'(°°), as c — i- oo. The term achiev- 
ing these minima determines which arrival or departure occurs. So we may 
define our coupled process, M^^\ up until the first arrival or departure time 
by (A.l) and by 



MW(ri.W-)-e,(,)„ ifri.W=5f\ 



The term eji G is the unit vector in the {j, i) G /C direction, thus e^i j 
corresponds to the arrival of the first packet in a route i document and — ej(j)j 
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corresponds to the departure of the final packet in a route i document. Also 
we may define A^(°°) by A^(°°)(i) := for t < r^'(°°) and by 

j^ioo). 1,(00). ^^n^ + e^, ifri'(-) = E„ 

^ ^' [n^-Bi, if n./ > and ri'(°°) = 

Similarly, Cj G M:^ is the ith unit vector. This defines our coupled process up 
to the first document arrival or departure time. We can now prove Proposi- 
tion A.l. 



Proof of Proposition A.l. By Lemma A. 3 we know that, almost 
surely, 

(A.3) ri'^'^) >T^'(^\ 

c— >oo 

Also as {5'|°°^ : Tij > 0} U {Ei : z E T} are independent exponentially distributed 
random variables, almost surely, no two terms are equal. Thus due to this 
and (A.3), almost surely, eventually as c— >oo, 

argmin[{S'f ^ : > 0} U {Ei 

= argmin[{S'J°°^ :ni > 0} U {Ei:i e J}], 
and so, almost surely, as cx), Ar(c)(7-i,{c)) ^(00) (-^1,(00)^^ □ 



Proposition A.l guarantees convergence up to and including the first doc- 
ument arrival or departure time. Essentially repeating this argument con- 
structs each interval [r^'^^^r^^^^'^'^^j and proves Theorem 3.1. 



Proof of Theorem 3.1. On a single probability space we will induc- 
tively construct N^°°^ and M^^^ for c G N. On the probability space we will 
prove that the following induction hypothesis holds Vk € Z_|_: there exists a 
coupling of M^'^^ for c G N and N^°°^ up to and including the Kth document's 
arrival or departure time such that, almost surely, < k, 

(A.4) r'='(^) ,r^>i°^) and iV^^Vr^^'^^)) ,^{00) . fc,{oo)y 

c— »oo c^oo 

Let us prove that the induction hypothesis holds for the case of k = 0. 
We must find a couphng of M^^^O) and N^°°\o) so that (A.4) holds. We 
know T^'^'^^ — > r^'(°°) as c — > 00 holds as r'^'^'^) := 0. By assumption A^('^)(0) 
converges weakly to N^°°\o), so by the Skorohod representation theorem, 
we may choose (A''('^)(0) : c G N U {00}), such that, almost surely. 



A^(^)(0) .Af(°°)(0). 

C— 
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X 



Given A^^"^) we know the required distribution of M^'^\o), so take f^'^^ : 
[0,1] — > such that for a uniform random variable U, 

P(/(^)(n,C/) =7n) =P(M(^) =m|iV(^) =n) V?nGZ^,nGZ^. 
Therefore taking an independent uniform random variable we may define 

m(^)(o) = /(^)(iv(^)(o),[/) VceN. 

Thus M^^) (0) is of the correct distribution and (A. 4) holds. This proves the 
induction hypothesis is true for k = 0. 

Suppose the induction hypothesis holds for k — 1. Using Proposition A.l, 
we will show the induction hypothesis holds for k by extending the process 
jVf(c) from time t'^~^'^'^^ to the next document arrival or departure time 

By the induction hypothesis 3c' e N such that Vc > c', iv(c)(r''"^'(^)) = 



m 



«-l,(c) ^^(c)(^,.-l,(c)) VOC'. 



Now apply Proposition A.l with initial state (m'^ ^'^^^ ■.c>c') to give process 
j\^'«,(c) defined until the first document arrival or departure time t^'^^^ — 
^fc-i,(c)_ extend M^^) to include t € [r^-^'W, r'^'(^)] by defining 

M('^)(t) :=M'^'(^)(t-T'''('=)) for i € K~^'(^\r^'(^)]. 

By Proposition A.l we know (A. 4) holds for k.. This completes the induction 
step. Thus the induction hypothesis holds for all k € N. At each induction 
step we required a countable collection of independent random variables, 
thus the coupled processes M^'^) and iV(°°) can be constructed on a proba- 
bility space consisting of a countable set of independent random variables. 
Since (A. 4) holds for all A; G N, Lemma A.l gives that, almost surely, 

c^oo 

in the Skorohod topology on [0, 1] . Thus this implies weak convergence. □ 

A. 2. Proof of Proposition 6.1. To prove Proposition 6.1 we expand on 
Lemma 6.1 to make a large deviation principle. The following lemmas will 
be used. The proofs of both lemmas are simple calculus arguments. 

Lemma A. 4. For m G and j ^ J fixed with mj > 0. Maximizing 
over vectors 9 = {Oiii 3 j) > satisfying constraint jei Pi^^^ = Cj, we 
have that 

mad 



max 
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Lemma A. 5. For fixed parameters A G (0,oo)^ and j G J^. Maximizing 
over probability distributions p= {pi/.iB j), we have that 

and i/ie minimum is attained by pi = ^ ■ 

We now prove Proposition 6.1. The result makes use of the upper bound 
of the Gartner-Ehis theorem (see [6], page 44). 

Proof of Proposition 6.1. First we prove the lower bound. Take 
O C M+ open, for ah m € O 3h' such that \lh>h', G O. By this and 

Lemma 6.1 we have that 

liminf ^ logpf^ € > sup liminf logP(M = \hm\) = — inf Pp{m). 

This gives the required lower bound. 

Now we prove the upper bound. M has moment generating function 



< 



Thus 



, oo, otherwise. 



F{e):= lim ilogEe^U-)e'=''^^"^'^^'/'' 



0, if J2 pie'^' <Cj,yjeJ, 



i : jei 

oo, otherwise. 



Let F*{-) be the Legendre-Fenchel transform of F{-). By Lemma A. 4, 
F*{m) = sup ( J2 ^j^ii ■■ E Pi""''' < Cj^j ^ A 



-{j,i)€K i-jei 



: E sup <^ E (pirriji: ^ /Ojc'^' < > 
■ E E '"i^ !!!21^ = /3 
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Mm G M^. Thus the Gartner-EUis theorem gives for all closed sets C C M;^ 

limsupilogpf^ G <- inf /3p(m). 
h^oo h \ h J m&C 

Finally, we prove /9p(-) is a good rate function. /3p(-) is continuous with 
values in M+ and so is a rate function. By Lemma A. 5, Va > 0, V?n G M;^ if 

a 

logCj -logE*:jeiPi 

then 

'YYj^ • • TT\ ' ' ' ' 



J : rrij >0 i \ j£i -J •^'^ y. nij >0 

Thus 

{m G M+ : Pp(m) < a} C |m G M+ : < m,- < 

I ^OgCj -logJ2i:jeiPi. 

So all level sets are compact and hence (3p{-) is a good rate function. □ 
A.3. Proofs of additional lemmas. 
Proof of Lemma 6.2. By Lemma A. 5 



inf f3\{m) = inf — ^log — ^^-r^ 



inf m'- log : 



0, if ^ A, < Cj, 

i : jGi 

— oo, otherwise. '— ' 

Proof of Lemma 6.3. We first show /3a(-) is convex on (0, oo)^. One 
can see Vm G (0, oo)^ and V(j, i), {I, r) G /C that 

So for all vectors a G M'^^' \ {0} the second partial derivative in direction a is 



{;,r)GK: 
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We will prove that this expression is positive. Note that for fixed j G J', 



nijr , 

For all i^r, a?,^x + a^^- is convex on (0, oo) and has its minimum at x 



1^1. So 

m 



i: j(^i r : j(^r i : j r : j dr 

Dividing by rrij and summing over j ^ J gives that 

5^/3a _ ajiajr \ ^ ^ 



da? 



. „ \ . . . m,-, ... 

This proves /3a(-) is convex on (0,oo)''^. 

Take m,?fi G and two sequences, rrS'^\'ffS^^ G (0, oo)^, that converge 
to m and m, respectively. By continuity of Pa{-), V6' G (0, 1) 

/3a (^m + (1 - = lim ()A{Om^^^ + (1 - 0)m('')) 

h— ►oo 

< lim ef3K{m^^^) + {l-9)(3A{'m^^^) 

h~*oo 

= e/3A{m) + {l-e)PA{m), 
thus Pa{-) is convex on M^. □ 

Proof of Lemma 7.1. It is clear that /3*^ is nonincreasing in J. We 
now claim that /?£ ^ /' q as 5 \ 0. If this were not so by compactness of 
our optimization region, we could choose 5^ \ and m'^ G M5: , with 



max 



<6k, inf \\m -m'||>e, sup/3p(mfc) < /3*o, 

m'eM(n) ' 



and such that for some m G M;^, m'^ — > m as /c ^ cx). But then by the conti- 
nuity of Pp, Pp{m) < /3*Q. This then contradicts the minimality of /3*q. So, 
/3*,//?*oas5\0. 

A^^(n) 

By strong duality of (6.3) and (6.4), /3o^o = Ei: n,>o ^» log " p- ■ I^p is 
continuous and M.{n) is compact, /3*o > /3oo > 0. Take any /(e) < /3*o — 
/3o,o- By the last paragraph, 3(5(e) such that V(5 < (5(e), /3* o - /?*5 < 2/(e). 
This gives that /3*g - /3*5 < /(e) + /3*_o - Pq^q, and hence the result. □ 
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